Goto

Collaborating Authors

 stratified random sampling


Stratified Random Sampling Using Python and Pandas

#artificialintelligence

Sometimes the sample data that data scientists are given does not fit what we know about the wider population data. For example, lets assume that the data science team were given survey data and we noticed that the survey respondents were 60% male and 40% female. In the real world the UK general population is closer to 49.4% male and 50.6% female (source: https://tinyurl.com/43hpe5e4) There could be many explanations for our 60% male sample data. One possibility is that the data collection method might have been flawed.